A Turkish Database for Psycholinguistic Studies Based on Frequency, Age of Acquisition, and Imageability

نویسندگان

  • Elif Ahsen Acar
  • Deniz Zeyrek
  • Murathan Kurfali
  • Cem Bozsahin
چکیده

This study primarily aims to build a Turkish psycholinguistic database including three variables: word frequency, age of acquisition (AoA), and imageability, where AoA and imageability information are limited to nouns. We used a corpus-based approach to obtain information about the AoA variable. We built two corpora: a child literature corpus (CLC) including 535 books written for 3-12 years old children, and a corpus of transcribed children's speech (CSC) at ages 1;4-4;8. A comparison between the word frequencies of CLC and CSC gave positive correlation results, suggesting the usability of the CLC to extract AoA information. We assumed that frequent words of the CLC would correspond to early acquired words whereas frequent words of a corpus of adult language would correspond to late acquired words. To validate AoA results from our corpus-based approach, a rated AoA questionnaire was conducted on adults. Imageability values were collected via a different questionnaire conducted on adults. We conclude that it is possible to deduce AoA information for high frequency words with the corpus-based approach. The results about low frequency words were inconclusive, which is attributed to the fact that corpus-based AoA information is affected by the strong negative correlation between corpus frequency and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BuscaPalabras : A program for deriving orthographic and phonological neighborhood statistics and other psycholinguistic indices in Spanish COLIN

A key tool for conducting well-controlled research with linguistic stimuli in a given language is an easy-to-use, comprehensive application for computing psycholinguistic statistics. As such, there are several useful databases for computing psycholinguistic statistics in English (e.g., MRC database, Coltheart, 1981; N-Watch, Davis, 2005) and in French (Lexique, New, Pallier, Brysbaert, & Ferran...

متن کامل

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...

متن کامل

Differential Weighting of Perceptual and Linguistic Factors Predicts English Noun Age-of-Acquisition

Age of acquisition (AoA) is a psycholinguistic variable that may prove useful toward gauging the relative weighting of phonological, semantic, and morphological factors at different phases of language acquisition. We examined a large corpus of English nouns using AoA as the outcome variable in three multivariate regressions, encompassing different age ranges (early-middle-late). Predictors incl...

متن کامل

Reading in Spanish and Italian: effects of age of acquisition in transparent orthographies?

Despite the similar transparency of their orthographies, reading in Italian has been found to be affected by frequency but not age of acquisition (AoA) [Barca, L., Burani, C., & Arduino, L. S. (2002). Word naming times and psycholinguistic norms for Italian nouns. Behaviour Research Methods, Instruments and Computers, 34, 424-434] while reading in Spanish is affected by AoA but not frequency [C...

متن کامل

A Lightweight Regression Method to Infer Psycholinguistic Properties for Brazilian Portuguese

Psycholinguistic properties of words have been used in various approaches to Natural Language Processing tasks, such as text simplification and readability assessment. Most of these properties are subjective, involving costly and time-consuming surveys to be gathered. Recent approaches use the limited datasets of psycholinguistic properties to extend them automatically to large lexicons. Howeve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016